Statistical Voice Conversion Techniques for Alaryngeal Speech Enhancement
نویسندگان
چکیده
This position paper gives a brief overview of our developed technologies for enhancing alaryngeal speech (AL speech) uttered by laryngectomees. There are several alternative speaking methods for laryngectomees to produce AL speech. However, any type of AL speech suffers from lack of naturalness and speaker individuality (identity). To address this issue, we have developed statistical voice conversion techniques for AL speech enhancement. Our developed techniques are capable of converting AL speech into normal speech in a probabilistic manner on the basis of statistics extracted from training data consisting of utterance pairs of AL speech and normal speech. Moreover, they are also capable of flexibly controlling voice quality of enhanced speech to effectively recover speaker individuality. We have developed three AL speech enhancement systems for 1) esophageal speech, 2) electrolaryngeal speech, and 3) bodyconducted silent electrolaryngeal speech. Our experimental results have demonstrated that these systems yield significant improvements in naturalness and speaker individuality of each type of AL speech.
منابع مشابه
Speaking-Aid Systems Based on One-to-Many Eigenvoice Conversion for Total Laryngectomees
This paper proposes speaking-aid systems based on one-to-many eigenvoice conversion (EVC) for enhancing three types of alaryngeal speech: esophageal speech; electrolaryngeal speech; and body-conducted silent electrolaryngeal speech. Although alaryngeal speech allows laryngectomees to utter speech sounds, it suffers from lack of naturalness and speaker individuality. To improve the sound quality...
متن کاملA digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion
In this paper, we present a digital signal processor (DSP) implementation of real-time statistical voice conversion (VC) for silent speech enhancement and electrolaryngeal speech enhancement. As a silent speech interface, we focus on nonaudible murmur (NAM), which can be used in situations where audible speech is not acceptable. Electrolaryngeal speech is one of the typical types of alaryngeal ...
متن کاملQuality Estimation of Alaryngeal Speech
Abstract— Quality assessment can be done using subjective listening tests or using objective quality measures. Objective measures quantify quality. The sentence material is chosen from IEEE corpus. Real world noise data was taken from the noisy speech corpus NOIZEUS. Alaryngeal speaker‘s voice (alaryngeal speech) is recorded. To enhance the quality of speech produced from the prosthetic device,...
متن کاملApplication of speech conversion to alaryngeal speech enhancement
Two existing speech conversion algorithms were modified and used to enhance alaryngeal speech. The modifications were aimed at reducing spectral distortion (bandwidth increase) in a vector-quantization (VQ) based system and the spectral discontinuity in a linear multivariate regression (LMR) based system. Spectral distortion was compensated for by formant enhancement using chirp z-transform and...
متن کاملUsing Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013